Visualizing the Repeat Structure of Genomic Sequences

نویسندگان

  • Nava Whiteford
  • Niall J. Haslam
  • Gerald Weber
  • Adam Prügel-Bennett
  • Jonathan W. Essex
  • Cameron Neylon
چکیده

Repeats are a common feature of genomic sequences and much remains to be understood of their origin and structure. The identification of repeated strings in genomic sequences is therefore of importance for a variety of applications in biology. In this paper a new method for finding all repeats and visualizing them in a two-dimensional plot is presented. The method is first applied to a set of constructed sequences in order to develop a comparative framework. Several complete genomes are then analyzed, including the whole human genome. The technique reveals the complex repeat structure of genomic sequences. In particular, interesting differences in the repeat character of the coding and noncoding regions of bacterial genomes are noted. The method allows fast identification of all repeats and easy intergenome comparison. In doing this the plot effectively creates a signature of a sequence which allows some classes of repeats present in a sequence to be identified by simple visual inspection. To our knowledge this is the first time all exact repeats have been visualized in a single plot that highlights the degree to which repeats occur within a genomic sequence, giving an indication of the important role repeats play. From this it is clear that large scale repeat analysis remains an important and unsolved problem in bioinformatics.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Data Mining for Identification of Forkhead Box O (FOXO3a) in Different Organisms Using Nucleotide and Tandem Repeat Sequences

 Background: Deregulation of FOXO3a gene which belongs to Forkhead box O (FOXO) transcription factors, can cause cancer (e.g. breast cancer). FOXO factors have important role in ubiquitination, acetylation, de-acetylation, protein-protein interactions and phosphorylation. Understanding the regulation and mechanisms of FOXO3a can lead to cancer treatment. The aim of this study recent association...

متن کامل

Designing Of Degenerate Primers-Based Polymerase Chain Reaction (PCR) For Amplification Of WD40 Repeat-Containing Proteins Using Local Allignment Search Method

Degenerate primers-based polymerase chain reaction (PCR) are commonly used for isolation of unidentified gene sequences in related organisms. For designing the degenerate primers, we propose the use of local alignment search method for searching the conserved regions long enough to design an acceptable primer pair. To test this method, a WD40 repeat-containing domain protein from Beauveria bass...

متن کامل

Genetic Structure of SSR1 & SSR2 loci from Iranian Mycobacterium Avium Subspecies Paratuberculosis Isolates by a Short Sequence Repeat Analysis Approach

Abstract         Background and Objective: Paratuberculosis has been repeatedly reported from Iranian ruminant herds. The extrem fastidious nature of Mycobacterium avium subspecies paratuberculsos hinders genomic diversity studies of the pathogen. Short Sequence Repeat analysis is one of the genome-based approches recently developed to overcome this d...

متن کامل

Visualising the repeat structure of genomic sequences

Repeats are a common feature of genomic sequences and much remains to be understood of their origin and structure. The identification of repeated strings in genomic sequences is therefore of importance for a variety of applications in biology. In this paper a new method for finding all repeats and visualising them in a two dimensional plot is presented. The method is first applied to a set of c...

متن کامل

Microsatellite Isolation and Characterization in Pomegranate (Punica granatum L.)

Development of microsatellite markers has been an increasing trend in crop genetic studies because oftheir applicability in breeding programs. Here we report the development of inter simple sequencerepeat (SSRs) in pomegranate (Punica granatum L.) using an enrichment method that makes use of magneticbeads. Enriched genomic libraries with AG and ATG microsatellite motifs were c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Complex Systems

دوره 17  شماره 

صفحات  -

تاریخ انتشار 2008